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Abstract 

We have combined recently published leptoquark results from the CDF and D0 
Collaborations which yielded 95% CL lower limits on the first generation scalar lep- 
toquark mass of 213 GeV and 225 GeV, respectively, under assumption of 100% 
branching fraction of the leptoquark decay into the eq channel. The combined limit 
from the two experiments is 242 GeV. This is the most stringent limit on the first 
generation scalar leptoquark mass to date. 

1 Introduction 

Recently, the D0 and CDF Collaborations at the Fermilab Tevatron have both pub- 
lished m limits on the pair production of the first generation scalar leptoquarks that 
ruled out an interpretation of the HERA high-Q 2 event excess reported by the HI and 
ZEUS Collaborations |3], |4]] as an s-channel production of leptoquarks with 100% branch- 
ing fraction to the charged lepton channel (eg). D0 set a 95% confidence level (CL) lower 
limit of 225 GeV on the mass of such a leptoquark; the analogous CDF limit is 213 GeV. 

In this paper we discuss a combination of the results of the two experiments, using both 
Bayesian and traditional frequentist approaches, that results in a tighter limit of 242 GeV. 
This is the most restrictive limit on the leptoquark mass to date. 



2 Individual results 

The experimental results from both experiments are summarized in Table |l|. In the region 
of interest (leptoquark mass M > 200 GeV) neither experiment observes any candidate 
events. 
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Table 1: Individual results from the CDF and D0 experiments 



Quantity 


CDF 


D0 


Number of candidates 








Background 


N/A 


0.44 ±0.06 


Efficiency 


28% ±11% 


36% - 39% ± 12% (M > 220 GeV) 


Integrated luminosity 


110 ± 8 pb- 1 


123 ± 6.5 pb" 1 



To obtain a combined limit, we need to understand the correlations between the two 
experiments. We have considered a number of systematic errors for different parameters 
used in the calculation of the integrated luminosities, efficiencies, and backgrounds for each 
of the experiments. Central values of these parameters and fractional errors in each of them 
are summarized in Table Let us discuss some of these errors in more details. 

CDF determines the integrated luminosity based on its own measurements of the in- 
elastic, single and double diffractive cross sections. The D0 calculation of the integrated 
luminosity is based on both the CDF |j| and E710 || inelastic, single diffractive, and dou- 
ble diffractive cross section measurements. Since the two measurements of the inelastic 
cross section differ by nearly two standard deviations, a x 2 -based factor of 1.85 is used to 
scale the errors in the inelastic cross sections of the two experiments (see for details). 
The single and double diffractive cross section measurements are in a good agreement and 
therefore no scaling of the errors quoted by either experiment was done for these cross 
sections. For simplicity of calculation we use here just the average of single and double 
diffractive cross sections for both the CDF and D0 luminosity calculation (CDF in fact 
uses just their own measurement but numerically it does not affect the results at all). We 
further neglect the error on the double diffractive cross section since it is small compared to 
the errors in the inelastic and single diffraction cross sections. The CDF- and DC-specific 
luminosity errors are then calculated by subtracting in quadrature the error due to the 
cross section measurements (based on each experiment's approach) from the total quoted 
errors of 7.2% (CDF) and 5.3% (D0). 

The other source of common systematics is the error in efficiency due to the MC mod- 
elling of the signal, dominated by the uncertainties due to parton distribution functions and 
gluon radiation. This fractional 10% error is assumed to be completely correlated between 
the two experiments and folded in the efficiency for each experiment as a 1.0 ±0.1 factor. 
Then the CDF- and DC-specific efficiency errors are obtained by subtracting the common 
10% error in quadrature from the overall quoted 11% (CDF) and 12% (D0) efficiency 
errors, which gives the experiment-specific errors of: 4.9% (CDF) and 6.5% (D0). 

3 Bayesian Approach 

In the Bayesian aproach we will define probability functions for each experiment in such 
a way that they take into account correlated and uncorrelated uncertainties. We assume 
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Table 2: Integrated luminosity, efficiency and background for CDF and D0 leptoquark 
searches. 



Parameter 


Center value 


Fractional uncertainty 


CDF inelastic c.s. 


60.33 pb 


2.3% 


E710 inelastic c.s. 


55.5 pb 


4.0% 


Average single diffractive c.s. 


9.54 pb 


4.5% 


CDF-specific luminosity error 


110 pb" 1 


6.6% 


DC-specific luminosity error 


123 pb" 1 


2.6% 


Common MC modelling factor 


1.00 


10.0% 


CDF-specific efficiency error 


0.28 


4.9% 


DC-specific efficiency error 


0.38 


6.5% 


CDF background 


N/A 


N/A 


D0 background 


0.44 


13.6% 



Gaussian errors in all the parameters. 

We start by determining the 95% CL upper limits from each experiment independently, 
i.e. reproducing published numbers. 

Since neither experiment observed any candidate events in the region of interest, the 
likelihood function of each measurement is given by Poisson distribution with given ex- 
pected signal and background means: 

P(n = | S(C, e), B, I) = e~ {s+B \ (1) 

where S(C,e) is the expected signal for a given integrated luminosity C and efficiency e, 
B is the expected background, and / represents other prior information. For a given cross 
section of the leptoquark pair production a we have: 

S{£,e)=ae£. (2) 

We can now apply Bayes' theorem and write down the expression for the posterior proba- 
bility for the cross section cr, given the observation of zero events in the data: 

. JdedCdBPjn = \aCe, B, I)p(e\I)p(C\I)p(B\I)p(a\I) 
P[CJln ' } J dadsd£p(£\I)p(C\I)p(B\I)p(a\I) ' [ ) 

where p(e\I), p(£\I), p(B\I) are prior probability densities for efficiency, integrated lumi- 
nosity and backgrounds, and are Gaussian by assumption. Finally, p{cr\I) is the prior for 
the signal cross section, and since the most basic assumption about the signal is that it 
cannot be negative, but otherwise can be anything, a natural choice for the signal prior is 
p(cr\I) = 0(a), where 9{x) is ^-function, defined as for x < and 1 for x > 0. 

An efficient way to calculate the integral (JJ) is to use a Monte Carlo (MC) integration 
by generating random values of C, e an B according to their Gaussian priors. The value 
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of the integral is simply the average value of p(n = \aCe, B, I) obtained in the series of 
the MC trials, since by definition probability density functions are normalized to unity. 

We then vary the input value of a for the MC trials to obtain P(a \ n — 0, I) in the 
entire range: < o < oo. The upper 95% confidence level limit on signal cross section, 
cr 95 , can then be obtained by solving the following integral equation: 

/ P(a\n = 0, 1)da = 0.95 / p(a | n = 0, 1)da. (4) 
Jo Jo 

(Here we have to normalize the posterior probability to unity since p(cr\I) = 0(a) is not 
properly normalized.) 

From the Eq. (JJ) it is natural to expect that p(a \ n = 0,1) can be parameterized as 
Ae~ acr . In this case equation (|j) transforms into: 

exp(-acx 95 ) = 1 - 0.95 = 0.05, 

which can be easily solved: 

a 95 = iln20 = 3.00/a. (5) 
a 

We can now apply this approach independently to the CDF and D0 measurements. The 
only subtlety here is that the efficiency e generally depends on the leptoquark mass, and 
thus far we have not made any connections between the mass and the cross section. One 
can obtain a more complex two-dimensional function p(a,M | n = 0,1) and then obtain a 
cross section limit for any given leptoquark mass. We however, will use the fact that for 
the leptoquark masses above 200 GeV the efficiency changes very slowly, and we simply 
use the efficiency measured at the published value of the mass limit for each experiment in 
order to reproduce the results. When calculating the limits for each particular experiment, 
we do not care about correlated and uncorrelated errors, and therefore can simply use 
overall uncertainties on C, e and B for each experiment. In the case of CDF, there was 
no background estimate made, and therefore no background subtraction was used when 
obtaining cross section limits. This fact, however, does not influence the limit since it is 
well known || that for the case of zero observed candidate events, the limit on the signal 
does not depend on the actual value of the background or its uncertainty. We therefore 
assign background B = 0.00 ± 0.00 events in the CDF case. 

The estimates of C, e and B for each of the experiments are summarized in Table |3|. 
A posterior probability function for D0 experiment is shown in Fig. [3], together with an 
exponential fit. The fit has essentially y 2 = 0, so indeed the assumption of exponential 
approximation works well. Using Eq. (||) we obtain the following 95% CL upper limits on 
the production cross sections: 

a 95 DF = 0.0993 pb (6) 
a 95 = 0.0663 pb, (7) 

which can be translated into the lower mass limits on the first generation scalar leptoquarks 
using parameterization of the lower band of next-to- leading order cross section 0: 

M > 213.5 GeV (CDF) (8) 
M > 225.8 GeV (D0), (9) 
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Table 3: Integrated luminosity, efficiency and background for CDF and D0 leptoquark 
searches. 



Parameter 


CDF 


D0 


C 


110 ±8 pb- 1 


123 ± 7 pb" 1 


e 


(28.0 ±3.1)% 


(38.0 ±4.6)% 


B 


N/A 


0.44 ±0.06 



xYndf 0.3089E-03/ 19 

PI -0.4494 ± 0.8477 

P2 -45.18 ± 31.14 



c yj (D0) 



i I — I ' i ■ I 
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Figure 1: Posterior probability p(a \n = 0, /) for D0 experiment and the corresponding 
95% CL cross section upper limit. The line shows a fit to the exponential function. The 
arrow indicates 95% CL cross section upper limit. 

in agreement with the published values |2|, []J . 

Having reproduced the individual results of each experiment, we can now combine them 
by breaking down the errors of each of the experiments into several pieces and by using 
one and the same random values for the correlated uncertainties, and different values for 
the uncorrelated ones during the MC integration. 

For the combined results from the two experiments we again have zero candidates 
observed, so the only required modification to the Eq. (fj) is that the signal expectation is: 

S = a{e 1 C 1 +e 2 C 2 ), (10) 

where indices 1 and 2 correspond to the CDF and D0 experiments, respectively. The 
overall background expectation is still the same, since we use no background for the CDF 
case. The rest of the formalism does not change, except that now we integrate not over 
dCdedB, but over all the individual Gaussian uncertainties used in the joint analysis (see 
Table D. 
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Figure 2: Posterior probability p(cx \n = 0, I) for combined CDF and D0 experiments and 
the corresponding upper 95% CL cross section limit. The line shows a fit to the exponential 
function. The arrow indicates 95% CL upper cross section limit. 

After performing the MC integration we obtain the posterior probability function for the 
CDF and D0 measurements, as shown in Fig. ^, which is well described by an exponential. 
The following 95% CL limits on the production cross section and the leptoquark mass are 
obtained: 

a < 0.0383 pb (11) 
M > 242.4 GeV. (12) 

This is the final result of the Bayesian combined analysis. 

4 Frequentist Analysis 

In the classical or frequentist approach, the upper limit at a confidence level a is defined such 
that, when the experiment is repeated a large number of times, the fraction of experiments 
that observe a value smaller or equal to the measured one is less or equal to a, if the true 
value of the quantity is larger than the upper limit. If the number of observed events is 
small they will be distributed according to Poisson statistics, P(n; /i) = e ~ f* , where n 
is the number of observed events and /i = Cue is the most probable number of events 
produced. £ is the integrated luminosity, e is the efficiency, and a is the production cross 
section. If we have two measurements of the same quantity, the total likelihood of observing 
n\ and ri2 events from the same source is: 

P(nx,n 2 | /f) = P{n x \ At)P(n 2 | /V). (13) 

To account for systematic uncertainties on the efficiency and luminosity, the Poisson 
probability is convoluted with the probability distributions of e and £, assumed to be 
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Gaussians. Then the probability of observing a number of events rij in experiment i is: 

P(n 4 | Csifji) = [ P{ni | Le\ii)G{e\\e u a i )de' i , (14) 
J o 

where | £j, <Jj) indicates the sampling of the efficiency with a Gaussian distribution of 
mean and sigma <7j. 

In combining two experiments the uncertainties in the efficiencies are decomposed in 
two parts: those correlated between the two experiments, and the uncorrelated ones, as 
indicated above. The total likelihood for observing a set of events in the two experiments 
is: 

2 

£ = Y[P(ni | CiEin), 
i 

with the correlation between the efficiencies taken into account: 



s~i / _/ \ ri I 1 corr I corr _corr\ /~i i I uncorr i uncorr 

Gie^Ei.ai) = G(£j | e 4 ,a l J • G{e { \e i ,a 



uncorr \ 



The likelihood is evaluated with a Monte Carlo technique ||1Q|| . The mean number /i« 
of events for each experiment is calculated from the relative luminosities and efficiencies, 
with the latter drawn from Gaussian distributions with a common width for the correlated 
uncertainties, and individual widths for the uncorrelated ones. The number of observed 
events v! i for each experiment is drawn from a Poisson distribution with mean /ij. By 
generating a large number of experiments, we calculate the fraction with v! i less or equal 
to the observed events (zero in this case) for the different values of \i. The 95% upper limit 
corresponds to the /x for which this fraction is < 0.05. With the values of the efficiencies 
discussed above, we obtain a 95% cross section limit of 0.0391 pb, which corresponds 
to a lower mass limit of 241.7 GeV for scalar leptoquarks. This calculation does not 
take into account backgrounds in each experiment, which in the case of zero observed 
events is equivalent to a proper background subtraction technique. Limits obtained by the 
frequentist method are in a good agreement with those from the Bayesian prescription. 



5 Conclusions 

We performed proper statistical combinations of the D0 M and CDF |2|] limits on the 
first generation scalar leptoquarks, using both Bayesian and frequentist approaches, with 
common systematic uncertainties taken into account. The two methods are in a good 
agreement. Our combined upper 95% CL cross section limit for leptoquark pair production 
is 38 fb, which corresponds to a lower LQ mass limit of 242 GeV, based on the lower band 
of the NLO calculations . The cross section limits from both experiments, as well as the 
combined result, are shown in Fig. ||], together with the theoretical predictions. This is the 
most stringent limit on the mass of first generation scalar leptoquarks to date. 
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Figure 3: 95% CL upper cross section limits from CDF (triangles), D0 (inverted triangles), 
and combined (squares) leptoquark analyses. The band shows the NLO theoretical cross 
section; arrows correspond to the respective 95% CL lower mass limits. 
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